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Abstract —We consider a wireless channel shared by multiple 
transmitter-receiver pairs. Their transmissions interfere with 
each other. Each transmitter-receiver pair aims to maximize its 
long-term average transmission rate subject to an average power 
constraint. This scenario is modeled as a stochastic game under 
different assumptions. We first assume that each transmitter and 
receiver has knowledge of all direct and cross link channel gains. 
We later relax the assumption to the knowledge of incident 
channel gains and then further relax to the knowledge of the 
direct link channel gains only. In all the cases, we formulate 
the problem of finding the Nash equilibrium as a variational 
inequality (VI) problem and present an algorithm to solve the 
VI. 

Keywords—Interference channel, stochastic game, Nash equilib¬ 
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I. Introduction 

We consider a wireless channel which is being shared 
by multiple users to transmit their data to their respective 
receivers. The transmissions of different users may cause 
interference to other receivers. This is a typical scenario 
in many wireless networks. In particular, this can represent 
inter-cell interference on a particular wireless channel in a 
cellular network. The different users want to maximize their 
transmission rates. This system can be modeled in the game 
theoretic framework and has been widely studied 12 - a 

In ID, the authors have considered parallel Gaussian 
interference channels. This setup is modeled as a strategic 
form game and existence and uniqueness of a Nash equilibrium 
(NE) is studied. The authors provide conditions under which 
the water-filling function is a contraction and thus obtain 
conditions for uniqueness of NE and for convergence of 
iterative water-filling. They extend these results to a multi¬ 
antenna system in a and consider an asynchronous version 
of iterative water-filling in (6j. 

An online algorithm to reach a NE for the parallel 
Gaussian channels is presented in ||2) when the channel gain 
distributions are not known to the players. Its convergence is 
also proved. In a authors describe some conditions under 
which parallel Gaussian interference channels have multiple 
Nash equilibria. Using variational inequalities, they present 
an algorithm that converges to a Nash equilibrium which 
minimizes the overall weighted interference. 

We consider power allocation in a non-game-theoretic 
framework in Q (see also other references in 0 for such 
a setup). In 0, we have proposed a centralized algorithm for 
finding the Pareto points that maximize sum rate when the 
receivers have knowledge of all the channel gains and decode 


the messages from strong and very strong interferers instead 
of treating them as noise. 

All the above cited works consider a one shot non- 
cooperative game (or a Pareto point). As against that we con¬ 
sider a stochastic game over Gaussian interference channels, 
where the users want to maximize their long term average rate 
and have long term average power constraints (for potential 
advantages of this over one shot optimization, see 0 , 0 ). For 
this system we obtain existence of NE and develop algorithms 
to obtain NE via variational inequalities. Further more, the 
above mentioned literature considers the problem when each 
user knows all the channel gains in the system while we also 
consider the much more realistic situation when a user knows 
only its own channel gains. 

The paper is organized as follows. In Section [TTI we present 
the system model and formulate it as a stochastic game. In 
Section [Dj] we study this stochastic game and define the basic 
terminology. In Section [IV] we propose an algorithm to solve 
the formulated variational inequality under general conditions. 
In Section [V] we use this algorithm to obtain NE when the 
users have only partial information about the channel gains. 
In Section [Vl] we present numerical examples and Section 
I VIII concludes the paper. 

II. System model and Notation 

We consider a Gaussian wireless channel being shared 
by N transmitter-receiver pairs. The time axis is slotted and 
all users’ slots are synchronized. The channel gains of each 
transmit-receive pair are constant during a slot and change 
independently from slot to slot. These assumptions are usually 
made for this system 0, (9|. 

Let Hij(k) be the random variable that represents channel 
gain from transmitter j to receiver i (for transmitter i, receiver 
i is the intended receiver) in slot k. The direct channel power 
gains \H u (k)\ 2 £ Ud = {g[ d) ,,•••, 9m} and the cross 
channel power gains \H l 3 (k )\ 2 £ U c = {g[ c \gi c \- . -,9nh- 
Let ltd and n c be the probability distributions on 'H,i and fL c 
respectively. We assume that, {Hij(k),k > 0} is an i.i.d 
sequence with distribution ntj where 7= 7r f ; if i = j and 
7 tij = 7r c if i ^ j. We also assume that these sequences are 
independent of each other. 

We denote ( Hij(k),i,j = 1 by H(fc) and its 

realization vector by h(k) which takes values in fL, the set of 
all possible channel states. The distribution of H(fc) is denoted 
by 7r. We call the channel gains ( Hij(k),j = 1, .. ., N) from 
all the transmitters to the receiver i an incident gain of user i 


and denote by H fik) and its realization vector by hfik ) which 
takes values in X, the set of all possible incident channel gains. 
The distribution of H,(A:) is denoted by 7 r/. 

Each user aims to operate at a power allocation that 
maximizes its long term average rate under an average power 
constraint. Since their transmissions interfere with each other, 
affecting their transmission rates, we model this scenario as a 
stochastic game. 


We first assume complete channel knowledge at all trans¬ 
mitters and receivers. If user i uses power Pj(H(fc)) in slot k, 
it gets rate log (1 + Ej (P (H(/c)))), where 


IMP(H(fc))) 


q t |ff»(fc)| 2 P(H(fc)) 
i + E^I^(fc)l 2 ^(H(fc))’ 


(1) 


We limit ourselves to stationary policies, i.e., the power 
policy for every user in slot k depends only on the channel 
state H(k ) and not on k. In the current setup, it does not 
entail any loss in optimality. In fact now we can rewrite 
the optimization problem in Q to find policy P(H) such 
that ri = En[log (1 + (P (H)))] is maximized subject to 
E h [Pi(H)] < Pi for all i. Similarly, we can rewrite the 
optimization problems in games Qj and Qi>. We express power 
policy of player * by Pj = {Pfih), h £ XL), where transmitter 
i transmits in channel state h with power Pi(h). We denote 
the power profile of all players by P = (Pi,... ,Pjv). 

In the rest of the paper, we prove existence of a Nash 
equilibrium for each of these games and provide algorithm to 
compute it. 


P(H(fc)) = (Pi(H(fc)),..., Pjv(H(fc))) and at is a constant 
that depends on the modulation and coding used by transmitter 
i and we assume on = 1 for all i. The aim of each user i is to 
choose a power policy to maximize its long term average rate 

1 n 

r<(Pi, P_i) = limsup - ]T E[log (1 + T z (P (H(fc))))], 
n ^°° n k = 1 

( 2 ) 

subject to average power constraint 
1 n 

limsup — VE[?,(H(t))] < Pi, for each i, (3) 

n—foo Tl 

k —1 

where P , denotes the power policies of all users except i. 
We denote this game by Qa- 

We next assume that the ith transmitter-receiver pair has 
knowledge of its incident gains H, only. Then the rate of user 
i is 

r^Pi.P-i)^ 

1 " 

limsup - E H a (fe) [ E H_i(fe) [log (1 + T^H^fc), H_i(/c)))]] 

n —>00 11 , 


where Pj(H(fc)) depends only on Hj(fc) and Ex denotes 
expectation with respect to the distribution of X. Each user 
maximizes its rate subject to ©, we denote this game by Qi. 

We also consider a game assuming that each transmitter- 
receiver pair knows only its direct link gain Ha. This is the 
most realistic assumption since each receiver i can estimate 
Ha and feed it back to transmitter i. In this case, the rate of 
user i is given by 


1 X 

7-i(Pi,P_i) = limsup- ^E Hii (fe) [E H _ is (fe) 

n ^°° n k = 1 

[log (1 + Ti{Hu{k), H_u(k)))]\, (5) 

where Pj(H(/c)) is a function of Hu(k) only. Here, H_a 
denotes the channel gains of all other links in the interference 
channel except H lr . In this game, each user maximizes its rate 
0 under the average power constraint ( 0 . We denote this 
game by Q D . 

We address these problems as stochastic games with the 
set of feasible power policies of user i denoted by A- L and its 
utility by r*. Let A = Y\XL 1 Ai. 


III. Game Theoretic Reformulation 

Theory of variational inequalities offers various algorithms 
to find NE of a given game ED- A variational inequality 
problem denoted by VI(K, F ) is defined as follows. 

Definition 1. Let K C E" be a closed and convex set, and 
F : K —» K. The variational inequality problem VI(K,F) is 
defined as the problem of finding x £ K such that 

F(x) T (y — x) > 0 for all y £ K. 

We reformulate the Nash equilibrium problem at hand 
to an affine variational inequality problem. We denote our 
game by G = ((A)ilu (p)ili)> where G(Pi,P-i) = 
E H [log(l+r i _(_P(H)))] and A t = {P, € R N : 

E h [Pi(H)] < Pi, Pi{h) > 0 for all h £ %}. 

Definition 2. A point P* is a Nash Equilibrium (NE) of game 
Q = ((A)ili, (g)(I 1 ) if for each player i 

n{ PJ.PV) > niP^PfJfor all Pi £ Ai. 

Existence of a pure NE for the strategic games Qa, Qi and 
i QD follows from the Debreu-Glicksberg-Fan Theorem ( liTOl . 

page no. 69), since in our game ry(Pj,P_j) is a continuous 
W function j n the profile of strategies P = (P,. P _,) £ A and 
concave in P, for Qa, Qi and Qi>. 

Definition 3. The best-response of player i is a function BRi : 
A-i Ai such that BRfiP^i) maximizes rj(Pj,P_j), 
subject to Pi £ Ai- 


We see that the Nash equilibrium is a fixed point of the 
best-response function. In the following we provide algorithms 
to obtain this fixed point for Qa- In Section [V] we will consider 
Qi and Q;>. Given other players’ power profile P ,, we use 
Lagrange method to evaluate the best response of player i. The 
Lagrangian function is defined by 

Ci{Pi,P-i) = n{Pi, P_t) + At (Pi - E h [P<(H)]J. 

To maximize CfiPi, P_j), we solve for P, such that = 

0 for each h £XL. Thus, the component of the best response of 
player i, BR,(P ,) corresponding to channel state h is given 
by 


BRi(P~p, h) = 


( l + J2^i\hij\ 2p j( h )) 
\hu\ 2 


max 


0, Ai(P_i) 


(6) 





where Aj(P_j) is chosen such that the average power con¬ 
straint is satisfied. 


It is easy to observe that the best-response of player i to 
a given strategy of other players is water-filling on f, (P_j) = 
(/*(P *; ft), h&U) where 




i 1 +J2j ¥ :i\ h ij\ 2P j( h )) 

| ft** | 2 


(7) 


For this reason, we represent the best-response of player i by 
WFj(P_j). The notation used for the overall best-response 
WF(P) = (WF(P(ft)),ft e H), where WF(P(ft)) = 
(WPi(P_i;ft),... 1 WP JV (P-^;ft)) and WF^P^-h) is as 
defined in (|6j. We use WFj(P_j) = (FFFj(P_j; ft), ft £ 'H). 

It is observed in ID that the best-response WFj(P^j) is 
also the solution of the optimization problem 


minimize ||Pj + fj(P_j)|| 2 , subject to Pj £ Ai. (8) 


As a result we can interpret the best-response as the projection 
of (—/*,i(P-*), ■ • ■, —/*,jv(P-*)) on to A- We denote the 
projection of x on to A % by 11^ (x). We consider ©, as 
a game in which every player minimizes its cost function 
||Pj + fj(P_j)|| 2 with strategy set of player i being A,. We 
denote this game by Q'. This game has the same set of NEs 
as Q because the best responses of these two games are 
equal. We now formulate the variational inequality problem 
corresponding to the game . 

Observe that ([8} is a convex optimization problem. Given 
P_i, a necessary and sufficient condition for P* to be a 
solution of the convex optimization problem of player i (CD, 
page 210) is given by 

E (W) + h )) (*iW - p *(h)) > o, (9) 

hen 

for all Xj e Ai. Thus, P* is a NE of the game Q' if © holds 
for each player i. We can rewrite the N inequalities in ([9} in 
compact form as 

(p* + ft + HP*y (x - P*) > 0 for all x £ A, (10) 


IV. Solving the VI for general channels 

In fTTl . we proved that if if = (/ + H) is positive 
semidefinite, then the fixed point iteration 

p(") =n^(P (n - 1) -rF(P (n_1) )), (11) 

converges to a NE. This condition is much weaker than one 
would obtain by using the methods in ID. In the current setup 
we aim to find a NE even if H is not positive semidefinite. 
For this, we present an algorithm to solve the VI(A,F) in 
general. 

We note that a solution P* of VI(A,F) satisfies 

p* = ru(p* -tF(p*)). (12 ) 

Thus, P* is a fixed point of the mapping T(P) = 
FU (P — F(P)). Using this fact, we reformulate the varia¬ 
tional inequality problem as a non-convex optimization prob¬ 
lem 

minimize ||P — II^ (P — F( P)) || 2 , 
subject to P £ A. (13) 

The feasible region A of P, can be written as a Cartesian 
product of Ai, for each i, as the constraints of each player 
are decoupled in power variables. As a result, we can split the 
projection 11^(.) into multiple projections Il^ i (.) for each i, 
i.e., ru(x) = (n^ 1 (xi),...,n^ JV (x iv )). For each player i, 
the projection operation I Uv, (x, ) takes the form 

11.4* (xj) = (max (0, Xi (ft) - Aj), ft £ U) , (14) 

where Aj is chosen such that the average power constraint is 
satisfied. Using (IT4l) . we rewrite the objective function in ( 1 1 31 
as 


||p-n 4 (P- J F(P))|| 2 = 

E (Pi (ft) - max{0,-/j(P_j; ft) - Aj}) 2 


hen,i 


ST' ( ■ fniu\ d" |ft*j| 2 ftj(ft) x 
= E 111111 ) P i\ h ), --Tt-f Aj 


hen,i 


| ft*; 


= E (min{Pj(ft),Pj(ft) +/j(P_j;ft) + Aj}) 2 . (15) 
heU,i 


where ft is a iVi-length block vector with N\ = \H.\, the 
cardinality of T~L, each block ft(ft),ft £ Ti, is of length N 


(l h n 


7, ■ 


Tftivivp 


and is defined by ft(ft) = 

block diagonal matrix H = diag |ftft(ft), ft £ T~L 
block iT(ft) defined by 


and H is the 
with each 


At a NE, the left side of equation (TiTl i is zero and hence each 
minimum term on the right side of the equation must be zero 
as well. This happens, only if 


Pi (ft) = 


0, if 


i+E Wi IM 2 hW 
IM 2 


+ Aj > 0, 


_ otherwise. 




0 

I ^*-7 I 
I/Mil 2 ’ 


if i = j, 
else. 


The characterization of Nash equilibrium in (flOt corre¬ 
sponds to solving for P in the variational inequality problem 
VI{A, F), 


F(P) T {x — P) > 0 for all x £ A, 
where F( P) = (I + H) P + ft. 


Here, the Lagrange multiplier Aj can be negative, as the 
projection satisfies the average power constraint with equality. 
At a NE Player i will not transmit if the ratio of total 
interference plus noise to the direct link gain is more than 
some threshold. 

We now propose a heuristic algorithm to find an optimizer 
of O. This algorithm consists of two phases. In the first 
phase, it attempts to find a better estimate of a power allocation 
using the fixed point iteration on the mapping T(P) that is 
close to a NE. For Qa this is algorithm (fTITl itself, which 









converges to the NE when H is positive semidefinite. When 
this condition does not hold, then we use it in Algorithm [T] to 
get a good initial point for the steepest descent algorithm of 
Phase 2. We will show in Section [Vl] that it indeed provides 
a very good initial point for Phase 2. For games Qj and Qd 
we will provide more justification for Phase 1 by showing that 
this corresponds to a better response dynamics. In the second 
phase, using the estimate obtained from Phase 1 as the initial 
point, the algorithm runs the steepest descent method to find a 
NE. It is possible that the steepest descent algorithm may stop 
at a local minimum which is not a NE. This is because of the 
non-convex nature of the optimization problem. If the steepest 
descent method in Phase 2 terminates at a local minimum 
which is not a NE, we again invoke Phase 1 with this local 
minimum as the initial point and then go over to Phase 2. We 
present the complete algorithm in Algorithm [1] 

In Section [Vl] we provide an example when H is positive 
semidefinite and use the algorithm in m to obtain a NE. We 
also use Algorithm [T| and obtain the same NE (which will be 
obtained from the first phase itself). Next we provide examples 
where H is not positive semidefinite. Thus the algorithm in 
sra may not converge. The present algorithm provides the 
NEs in just a few iterations of Phase 1 and Phase 2. 


Algorithm 1 Heuristic algorithm to find a Nash equilibrium 
Fix e > 0, S > 0 and a positive integer MAX 
Phase 1 : Initialization phase 

Initialize P|°' for all % 1...., A', 

for n = 1 -a MAX do 

p(ra) _ p(p("-l)) 

end for 

go to Phase 2. 


V. Partial Information games 

In partial information games, we can not write the problem 
of finding a NE as an affine variational inequality, because the 
best response is not water-filling and should be evaluated nu¬ 
merically. In this section, we show that we can use Algorithm 
□ to find a NE even for these information games. 


A. Game Q / 


We first consider the game Qi and find its NE using 
Algorithm □ We follow on similar lines as in Sections uni 
and UV] We write the variational inequality formulation of the 
NE problem. For user i, the optimization at hand is 


maximize r[ T \ subject to Pj £ A, 


(16) 


where = E/ li£ i 7r (^) E |_l°g [} + 

The necessary and sufficient optimality conditions for the 
convex optimization problem (IT6l > are 

(Xi - P*) T (-Virf ) (P*,P_ i )) > 0, for all x, £ A, (17) 


\hu \ 2 


i+e 3 ^ i M 2 p#<Hn 


where Vir^(P*,P_*) is the gradient of with respect to 
power variables of user i. Then P* is a NE if and only if 
{FT} is satisfied for all i = 1,... ,7V. We can write the N 
inequalities in (fl7T> as 


(x - P*) t F(P*) > 0, for all x £ A, (18) 


where F( P) = (—Vir^ (P),..., — (P)) t . Equation 

m is the required variational inequality characterization. A 
solution of the variational inequality is a fixed point of the 
mapping T/(P) = n^(P — tF( P)), for r > 0. We use 
Algorithm^ to find a fixed point of Tj(P) by replacing T(P) 
in Algorithm □ with T/(P). 


Phase 2 : Optimization phase 


B. Better response 


Initialize t = 1, P^ = P M/LY , 

loop 

For each i, Pf +1) = Steepest_Descent(P*^, *) 
where P® = (Pf +1) ,..., P^, Pf\ ..., P$), 

p(£+l) _ ^p(*+l) p(*+l)^ 

t = t+ 1, 

Till ||pW -T(PW)|| < e 

if ||P(‘) p(*+t) || < s and ||pW - T(P«)|| > e then 

Go to Phase 1 with p(°l = pW 

end if 
end loop 

function Steepest_Descent(P (4 1, i) 

V/(P (t) ) = ( dPi(h) |p=PW; h &H) 
where /(P) = ||P - T(P)|| 2 

for h £ W do 

evaluate |p_pw using derivative approxima¬ 

tion 

end for 

Pi t+1) = n^(pW- 7t v/(pW)) 

return Pf +1) 

end function 


In this subsection, we interpret T/(P) as a better response 
for each user. For this, consider the optimization problem (IThl i. 
For this, using the gradient projection method, the update rule 
for power variables of user i is 


,(n+l) 


= n A (Pf ) +rV i rf ) (pW)). 


(19) 


The gradient projection method ensures that for a given P 


(ra) 


.(/’(pln+HpA) > 


M) 


(P 


(n) p(n) 


Therefore, we can 


interpret p| n+1 ^ as a better response to P■ than P- n) . As 
the feasible space A = I A^ we can combine the update 
rules of all players and write 

p(n+i) = ^(pW - rF(P (n) )) = T/(P (n) ). (20) 


Thus, the Phase 1 of Algorithm^is the iterated better response 
algorithm. 


Consider a fixed point P* of the better response Tj( P). 
Then it implies that, given PLj, P* is a local optimum of 
m for a H i- Since the optimization (IT6t is convex, P* is 
also a global optimum. Thus given PLj, P* is best response 
for all i and hence NE is also a fixed point of the better 
response function. This gives further justification for Phase 
1 of Algorithm □ We could not provide this justification for 
Qa when H is not positive semidefinite. Indeed we will show 










in the next section that in such a case Phase 1 often provides 
a NE for Gi and Go (for which also Phase 1 provides a better 
response dynamics; see Section |V-DI below) but not for G a ■ 


C. Lower bound 


In the computation of NE, each user i is required to know 
the power profile P , of all other users. We now give a lower 
bound on the utility rj 11 of player i that does not depend on 
other players’ power profiles. 

We can easily prove that the function inside the expectation 
in r\ r> is a convex function of P j(hj) for fixed P i(hi) using 
the fact that f lfl6l > a function f : K. C R" —> R is convex if 
and only if 

dV(x + iy) >n 
dt 2 

for all x, y G /C and t G R is such that x + fy G /C. Then by 
Jensen’s inequality to the inner expectation in r \ 1 '', 


M) 


= 5Z n ( h i) E 


hi€l 


log 1 


\hu\ 2 Pi(h.i) 


> 7T ( h i) l °S ( 1 


hiex 


1 + 

IhaPPAhi) N 


= 7r ( /i *) 1 °g ( 1 


| ha\ 2 Pi(hi 


h^x 


1 + Ej^i \hij\ 2 Pj 


( 21 ) 


The above lower bound r^l B (P j) of (Pj, P_i) does not 
depend on the power profile of players other than i. We can 
choose a power allocation P, of player i that maximizes 
r i a h (Pi)- ^ i s the water-filling solution given by 


Pi (hi) = max < 0, A* - 


1 + E jjii \hij\ 2 Pj 

| hu \ 2 



Fig. 1. Sum rate comnarison at Nash eauilibrium noints for Exarrmle 1. 
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Fig. 2. Sum rate comparison at Nash equilibrium points for Example 2. 

and Jensen’s inequality as in ( |2T1 >. The optimal solution for 
the lower bound is the water-filling solution 


Pi(hu) 


max 



i + E 




E[| Hitf]P 2 


I hi, 


VI. Numerical Examples 


Let P* = (P*,PIj) be a NE, and let Pj be the maxi¬ 
mizer for the lower bound r P BB (Pi). Then, (P*, PtJ > 
AOrp r>* 1 f or a jj p. g i n particular for Pj = Pj. 


(Pi,P- 


Thus, rl^(Pr,Pt 4 ) > r< J) (P,t,PV)- But, (Pj,PV) > 


(Pi 


r (P 

' i,LB 


(Pj). Therefore, r- /) (P*,PA l ) > 


r (P 

' i,LB 


(Pj). But, in 


general it may not hold that (P*, Ptj) > (P|, pE). 


D. Game Qd 

We now consider the game Go where each user i has 
knowledge of only the corresponding direct link gain Ha. 
In this case also we can formulate the variational inequality 
characterization. The variational inequality becomes 


In this section we compare the sum rate achieved at a Nash 
equilibrium under the different assumptions on the channel 
gain knowledge, obtained using the algorithms provided above. 
In all the numerical examples, we have chosen r = 0.1 and 
the step size in the steepest descent method = 0.5 for t = 1 
and is updated after 10 iterations as 7*+io = ■ We choose 

a 3-user interference channel for Examples 1 and 2 below. 

For Example 1, we take T-Ld = {0.3,1} and T-L c = 
{0.2, 0.1}. We assume that all elements of TLd , T~Lc occur with 
equal probability, i.e., with probability 0.5. Now, the H matrix 
is positive definite and there exists a unique NE. Thus, the 
fixed point iteration (ITTI) converge to the unique NE for Ga ■ 
Algorithm |T| also converges to this NE not only for Ga but 
also for Qj and Go- 


(x - P*) t F d (P*) > 0, for all x G A, (22) 

where Fd(P) = (— Vir[ D ^ (P),..., — Vjvt^ (P)) t . We use 
Algorithm [I] to solve the variational inequality (122} by finding 
fixed points of Td(P) = II^(P — tFd( P)). Also, one can 
show that as for T/, To provides a better response strategy. 
We can also derive a lower bound on r\ ,}> using convexity 


We compare the sum rates for the NE under different 
assumptions in Figure Q] We have also computed Q = P 
that maximizes the corresponding lower bounds (EH . evaluated 
the sum rate s(Q) and compared to the sum rate at a NE. The 
sum rates at Nash equilibria for Gi and Go are close. This is 
because the values of the cross link channel gains are close and 
hence knowing the cross link channel gains has less impact. 



























allocation. 



Fig. 3. Sum rate comparison at Nash equilibrium points for Example 3. 

We now give a couple of examples in which H is not 
positive semidefinite and hence fixed point iteration (fm > fails 
to converge to a NE but Algorithm [j] converges to a NE for 
Qa, Gi and Qjj. 

For Example 2, we take 'H,i = {0.3,1} and 77,. = 
{0.1,0.5}. We assume that all elements of 77,/. 777 occur with 
equal probability. We compare the sum rates for the NE in 
Figures [2] Now we see significant differences in the sum rates. 

Consider a 2-user interference channel for Example 3. 
We take U d = {0.1,0.5,1} and U c = {0.25,0.5,0.75}. 
We assume that all elements of Hd,'H c occur with equal 
probability. In this example also, we use Algorithm Q] to find 
NE for the different cases and the lower bound. We compare 
the sum rates for the NE in Figures [3 We further elaborate 
on the usefulness of Phase 1 in Algorithm Q] We quantify the 
closeness of P to a NE by g( P) = ||P — T(P)||. If P is a NE, 
g(Pj = 0 and for two different power allocations P and Q, 
we say that P is closer to a NE than Q if g(P) < g{ Q). We 
now verify that the fixed point iterations in the initialization 
phase of Algorithm |T| takes us closer to a NE starting from 
any randomly chosen feasible power allocation. For this, we 
have randomly generated 100 feasible power allocations and 
run Phase 1 for MAX = 100 for each randomly chosen power 
allocation and compared the values of g(P). In the following, 
we compare the mean and standard deviation of the values of 
g(P) immediately after random generation of feasible power 
allocations to those after running the initialization phase for 
the 100 initial points chosen. 

For complete information game, in Example 1: (mean, 
standard deviation) of values of g(P) after random generation 
of feasible power allocations at lOdB and 15dB is (230.86, 3.8) 
and (659.22, 9.21) respectively. The (mean, standard deviation) 
for those samples after running the Phase 1 are (0.6260, 0.055) 
and (2.05, 0.166) respectively at lOdB and 15dB. Similarly in 
Example 2: (mean, standard deviation) of g(P) after random 
generation is (309.12, 4.4) and (950.01, 10.41) respectively at 
lOdB and 15dB and those after initialization phase are (9.63, 
1.83) and (32.26, 6.6). In Example 3: the mean and standard 
deviation of fj(P) immediately after the random generation 
are (101.85, 1.1140) at lOdB and (339.97, 9.68) at 15dB. 
The (mean, standard deviation) after running the Phase 1 are 
(0.83, 0.68) at lOdB and (2.82, 1.47) at 15dB. Thus, we can 
see that the phase 1 in Algorithm Q] provide a much better 
approximation to a NE than a randomly chosen feasible power 


For all the three examples, for Gi and Gr>, Phase 1 itself 
provides the NE. 

We have run Algorithm [I] on many more examples and 
found that it computed the NE, and for Gi and Go the Phase 
1 itself provided the NE. 

VII. Conclusions 

We have considered a channel shared by multiple 
transmitter-receiver pairs causing interference to each other. 
We have modeled this system as a non-cooperative stochastic 
game. Different transmitter-receiver pairs may or may not have 
channel gain information about other pairs’ channel gains. Ex¬ 
ploiting variational inequalities, we provide an algorithm that 
obtains NE in the various examples studied quite efficiently. 
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